SIMD Tree Algorithms for Image Correlation
نویسندگان
چکیده
This paper examines the applicability of fine-grained “pure” tree SIMD machines, which are amenable to highly efficient VLSI implementation, to image correlation which is a representative of low-level image windowbased operations. A particular massively parallel machine called NONVON is used for purposes of explication and performance evaluation. Several algorithms are presented for image shifting and correlation operations. Novel algorithmic techniques are described, such as vertical pipelining, subproblem partitioning, associative matching, and data duplication that effectively exploit the massive parallelism available in finegrained SIMD tree machines. Limitations of SIMD pure tree machines are also addressed. They tend to correspond to situations in which the root of the tree may become a significant communication bottleneck, or in situations in which MIMD techniques would be more effective than the SIMD approaches considered in this paper. Performance results have been projected for the NON-VON machine (using only its tree connections, in order to address the issues of concern in this paper).
منابع مشابه
Speculative Parallel Evaluation Of Classification Trees On GPGPU Compute Engines
We examine the problem of optimizing classification tree evaluation for on-line and real-time applications by using GPUs. Looking at trees with continuous attributes often used in image segmentation, we first put the existing algorithms for serial and data-parallel evaluation on solid footings. We then introduce a speculative parallel algorithm designed for single instruction, multiple data (SI...
متن کاملParallel Generation of t-ary Trees
A parallel algorithm for generating t-ary tree sequences in reverse B-order is presented. The algorithm generates t-ary trees by 0-1 sequences, and each 0-1 sequences is generated in constant average time O(1). The algorithm is executed on a CREW SM SIMD model, and is adaptive and cost-optimal. Prior to the discussion of the parallel algorithm a new sequential generation with O(1) average time ...
متن کاملAdapting Tree Structures for Processing with SIMD Instructions
In this paper, we accelerate the processing of tree-based index structures by using SIMD instructions. We adapt the B+-Tree and prefix B-Tree (trie) by changing the search algorithm on inner nodes from binary search to k-ary search. The k-ary search enables the use of SIMD instructions, which are commonly available on most modern processors today. The main challenge for using SIMD instructions ...
متن کاملParallelization of Tree-Recursive Algorithms on a SIMD Machine
The set of tree-recursive algorithms is large, including constraint satisfaction using backtracking, iterative-deepening search such as IDA*, depth-first branch-and-bound, twoplayer game minimax search, and many divide-and-conquer algorithms. We describe a structured method for implementing such algorithms on SIMD machines, and identify measures for determining if a tree-recursive application i...
متن کاملSimd Optimization of Euclidean Distance Transforms for Pattern Recognition
This paper describes a SIMD optimization method for computing different Euclidean distance algorithms. Distance transforms have been widely applied to image analysis and pattern recognition problems. The proposed approach is based on the inherent fine and medium-grain parallelism of considered distance algorithms and has been implemented using Intel Streaming SIMD Extensions (SSE), intrinsics a...
متن کامل